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<110> Valtion teknillinen tutkimuskeskus 

<120> Process for partitioning of molecules 

<130> 31805 

<140> 
<141> 

<160> 42 

<170> Patentln Ver. 2.2 

<210> 1 
<2ll> 428 
<212> DNA 

<213> Trichoderma reesei 
<220> 

<22l> intron 
<222> (167) . . (236) 

<220> 

<221> intron 
<222> 1323) . . (386) 

<220> 

<223> Coding sequence of hfbl 
<400> 1 

atgaagttct tcgccatcgc cgctctcttt gccgccgctg ccgttgccca gcctctcgag 60 
gaccgcagca acggcaacgg caatgtttgc cctcccggcc tcttcagcaa cccccagtgc 120 
tgtgccaccc aagtccttgg cctcatcggc cttgactgca aagtccgtaa gttgagccat 180 
aacataagaa tcctcttgac ggaaatatgc cttctcactc ctttacccct gaacagcctc 240 
ccagaacgtt tacgacggca ccgacttccg caacgtctgc gccaaaaccg gcgcccagcc 3 00 
tctctgctgc gtggcccccg ttgtaagttg atgccccagc tcaagctcca gtctttggca 360 
aacccattct gacacccaga ctgcaggccg gccaggctct tctgtgccag accgccgtcg 420 
gtgcttga 428 



<210> 2 
<2ll> 78 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 2 

tcgggcacta cgtgccagta tagcaacgac tactactcgc aatgccttgt tccgcgtggc 60 
tctagttctg gaaccgca 7 8 
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<210> 3 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3 1 primer 
<400> 3 

tcgtacggat cctcaagcac cgacggcggt 3 0 



<210> 4 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223=> Description of Artificial Sequence: PCR 5' primer 
<400> 4 

actacacgga ggagctcgac gacttcgagc agcccgagct gcacgcagag caacggcaac SO 
ggc 6 3 



<210> 5 
<211> 2211 
<212> DNA 

<213> Trichoderma reesei 



<220> 

<221> promoter 

<222> (1) . . (2211) 

<223> cbhl promoter sequence 



<400> 5 

gaattctcac ggtgaatgta ggcctcttgt 
acctccatta cgcctccccc atagagttzcc 
tagattgggg agaagttgac ttccgcccag 
ggtcggcaac ggcaaaaaag cacgtggctc 
tccaggaacc tggatacatc catcatcacg 
tattcgccct aaaccgaagt gcgtggtaaa 
gtgtgtcttc tctaggtgca ttctttcctt 
tccgagctgt aactacctct gaatctctgg 
tgcatcatgt atataatagt gatcctgaga 
tggtcatcaa acaaagaacg aagacgcctc 
agaactggat acttgttgtg tcttctgtgt 
tctattcaaa caccaagctt gctcttttga 
agttgtgaag tcggtaatcc cgctgtatag 
ccgctgcgaa cccggagaat cgagatgtgc 
catgaaaggc tatgagaaat tctggagacg 
acaagcaaag cgttccgtcg cagtagcagg 
taagtagcga tggaaccgga ataatataat 
caatgcaggg gtactgagct tggacataac 
ggcgtttccc tgattcagcg tacccgtaca 
ggacgtgttt tgcccttcat ttggagaaat 
tgaccgactg gggctgttcg aagcccgaat 
aggcatgttg tgaatctgtg tcgggcagga 



agggtaggaa ttgtcactca agcaccccca 60 
caatcagtga gtcatggcac tgttctcaaa 120 
agctgaaggt cgcacaaccg catgatatag 180 
accgaaaagc aagatgtttg cgatctaaca 240 
cacgaccact ttgatctgct ggtaaactcg 300 
tctacacgtg ggcccctttc ggtatactgc 360 
cctctagtgt tgaattgttt gtgttgggag 420 
agaatggtgg actaacgact accgtgcacc 480 
aggggggttt ggagcaatgt gggactttga 540 
ttttgcaaag ttttgtttcg gctacggtga 600 
atttttgtgg caacaagagg ccagagacaa 660 
gctacaagaa cctgtggggt atatatctag 720 
taatacgagt cgcatctaaa tactccgaag 780 
tggaaagctt ctagcgagcg gctaaattag 840 
gcttgttgaa tcatggcgtt ccattcttcg 900 
cactcattcc cgaaaaaact cggagattcc 960 
aggcaataca ttgagttgcc tcgacggttg 1020 
tgttccgtac cccacctctt ctcaaccttt 1080 
agtcgtaatc actattaacc cagactgacc 1140 
aatgtcattg cgatgtgtaa tttgcctgct 1200 
gtaggattgt tatccgaact ctgctcgtag 1260 
cacgcctcga aggttcacgg caagggaaac 1320 
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caccgatagc agtgtctagt agcaacctgc aaagccgcaa tgcagcatca ctggaaaata 1380 
caaaccaatg gctaaaagta cataagttaa tgcctaaaga agtcatatac cagcggctaa 1440 
taattgtaca atcaagtggc taaacgtacc gcaatctgcc aacgcgttgt ggggttgcag 1500 
aagcaacggc aaagcccact tcccacgttt gtttcttcac tcagtccaat cecagctggt 1560 
gatcccccaa ttgggtcgct cgcctgtccc ggtgaagtga aagaagacag aggtaagaaC 1620 
gtctgactcg gagcgtctcg catacaacca agggcagtga tggaagacag tgaaatgtcg 1680 
acaCtcaagg agtattcagc cagggatgct tgagtgtatc gtgtaaggag gttcgtctgc 1740 
cgacacgacg aatactgtat agtcacttct gatgaagtgg tccatatega aatgtaagcc 1800 
ggcactgaac aggcaaaaga ctgagtcgaa actgcctaag atctcgggcc ctcgggcttc 1860 
ggctttgggt gtacatgttt gtgctccggg caaacgcaaa gtgtggcagg accgaca'-ac 1920 
tgctgcctct accaagcagc tgagggtatg tgataggcaa atgttcaggg gccactgcat 1980 
ggtttcgaat agaaagagaa gcttagccaa gaacaatagc cgataaagat agcctcatta 2040 
aacgaaacga gctagtaggc aaagccagcg aatgtgcata tataaaggnt cgaggtccgc 2100 
gcctccctca tgctctcccc atctactcat caacccagat cctccaggag acCtgtacac 2160 
catcttttga ggcacagaaa cccaatagtc aaccgcggac tgcgcatcat g" 2211 



<210> 6 
<211> 1588 
<212> DNA 

<213> Trichodezwa reesei 
<220> 

<223> T. reesei egll cDNA 
<400> 6 

cccccctatc ttagtccttc ttgttgtccc aaaatggcgc cctcagttac actgccgttg 60 
accacggcca tcctggccat tgcccggctc gtcgccgccc agcaaccggg taccagc?/:c 120 
cccgaggtcc atcccaagtt gacaacctac aagtgtacaa agtccggggg gtgcgtggcc 180 
caggacacct cggtggtccC tgactggaac taccgctgga tgcacgacgc aaactacaac 24 0 
tcgcgcaccg tcaacggcgg cgtcaacacc acgctctgcc ctgacgaggc gacctgtggc 300 
aagaactgct tcatcgaggg cgtcgactac gccgcctcgg gcgtcacgac ctcgggcagc 360 
agccccacca cgaaccagta catgcccagc agctctggcg gctacagcag cgtctctcct 420 
cggctgtatc tcctggactc tgacggtgag cacgtgatgc tgaagctcaa cggccaggag 480 
ccgagcttcg acgtcgacct ctctgctctg ccgtgtggag agaacggctc gctctacctg 540 
tctcagatgg acgagaacgg gggcgccaac cagtacaaca cggccggtgc caactacggg 600 
agcggctact gcgatgctca gtgccccgtc cagacatgga ggaacggcac ccccaacact 660 
agccaccagg gcttctgctg caacgagatg gatatcctgg agggcaactc gagggcgaat 720 
gccttgaccc ctcactcttg cacggccacg gcctgcgact ctgccggttg cggcttcaac 780 
ccctatggca gcggctacaa aagctactac ggccccggag ataccgttga cacctccaag 84 0 
accttcacca tcatcaccca gttcaacacg gacaacggct cgccctcggg caaccttgtg 900 
agcatcaccc gcaagtacca gcaaaacggc gtcgacatcc ccagcgccca gcccggcggc 960 
gacaccatcc cgtcctgccc gtccgcctca gcctacggcg gcctcgccac cacgggcaag 1020 
gccctgagca gcggcatggt gcccgtgttc agcatttgga acgacaacag ccagtacatg 108 0 
aaccggcccg acagcggcaa cgccggcccc tgcagcagca ccgagggcaa cccacccaac 1140 
atcctggcca acaaccccaa cacgcacgtc gtcctctcca acatccgccg gggagac . .t 1200 
gggcctacca cgaactcgac tgcgcccccg cccccgcctg cgtccagcac gacgttctcg 1260 
actacacgga ggagctcgac gacttcgagc agcccgagct gcacgcagac tcactggggg 1320 
cagtgcggtg gcattgggca cagcgggtgc aagacgcgca cgccgggcac tacgtgccag 1380 
tatagcaacg actactactc gcaatgcctt tagagcgttg acttgcctct ggtctgtcca 1440 
gacgggggca cgatagaatg cgggcacgca gggagctcgt agacatcggg cttaatatat 1500 
aagacatgct atgttgtatc tacattagca aatgacaaac aaatgaaaaa gaacttatca 1560 
agcaaaaaaa aaaaaaaaaa aaaaaaaa 1588 
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<210> 7 
<211> 745 
<212> DNA 

<213> Trichoderma reesel 
<220> 

<221> cerminator 
<222> (1) . . (745) 
<223> T. reesei cbhl terminator 

<400> 7 

ggacctaccc agtctcacta cggccagtgc ggcggtattg gctacagcgg ccccacggtc 60 
tgcgccagcg gcacaacttg ccaggtcctg aacccttact actctcagtg cctgtaaagc 120 
tccgtgcgaa agcctgacgc accggtagat tcttggtgag cccgtatcat gacggcggcg 180 
ggagctacat ggccccgggt gatttatttt ttttgtatct acttctgacc cttttcaaat 24 0 
atacggtcaa ctcatctttc actggagatg cggcctgctt ggtattgcga tgttgtcagc 300 
ttggcaaatt gtggctttcg aaaacacaaa acgattcctt agtagccatg cattttaaga 36 0 
taacggaaza gaagaaagag gaaattaaaa aaaaaaaaaa aacaaacatc ccgttcataa 420 
cccgtagaat cgccgctctt cgtgtatccc agtaccacgt caaaggtact catgatcgtt 480 
caatgttgat attgttccgc cagtatggct ccacccccat ctccgcgaat ctcctcttct 540 
cgaacgcggt agtggctgct gccaattggt aatgaccata gggagacaaa cagcataata 600 
gcaacagtgg aaattagtgg cgcaataatt gagaacacag tgagaccata gctggcggcc 660 
tggaaagcac cgttggagac caacttgtcc gttgcgaggc caacttgcat tgctgtcaag 72 0 
acgatgacaa cgtagccgag gaccc 745 



<210> 8 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: annealed primer 
<400> 8 

taaccgcggt l'o 



<210> 9 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: annealed primer 
<400> 9 

ctagaccgcg gttaat 16 



<210> 10 
<211> 1232 
<212> DMA 

<213> Trichoderma reesei 
<220> 

<221> promoter 
<222> (1) . . (1232) 
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<223> T. reesei gpdl promoter 
<400> 10 

gtcgacacga tatacaggcg cggctgatga taatgatgat cgagcatgac ttgatgctgt 60 
atgtgacaat attgactgcg aggaaccatc aggtgtgtat ggatggaatc attctgtaa:: 120 
caccaaggtg catgcatcae aaggattctc cccagctcac caacaacgaa cgatggccat 180 
gttagtgaag gcaccgtgat ggcaagatag aaccactatt gcatctgcgc ttcccacgca 240 
cagtacgtca agtaacgtca aagccgccct cccgtaacct cgcccgttgt cgctccc jc 300 
gattgcctca atcacatagt acctacctat gcattatggg cggcctcaac ccaccccccc 360 
agatcgagag ctaccttaca tcaatatggc cagcacctct tcggcgatac atactcgcca 420 
ccccagccgg cgcgattgtg tgtactaggt aggctcgtac tataccagca ggagaggtgc 480 
tgcttggcaa tcgtgctcag ctgttaggtt gtacttgtat ggtacttgta aggtggtcat 540 
gcagttgcta aggtacctag ggagggattc aacgagccct gcttccaatg tccatctgga 600 
taggatggcg gctggcgggg ccgaagctgg gaactcgcca acagtcatat gtaatagctc 660 
aagttgatga taccgttttg ccagattaga cgcgagaagc agcatgaatg tcgctcatcc 720 
gatgccgcat caccgttgtg tcagaaacga ccaagctaag caactaaggt accttaccgt 780 
ccactatctc aggtaaccag gtactaccag ctaccctacc tgccgtgcct acctgcttta 840 
gtgttaatct ttccacctcc ctcctcaatc ttcttttccc tcctctcctc CttttCttCC 900 
cttcctcctc ttcttctcca taaccattcc taacaacatc gacattctct cctaatcacc 960 
agcctcgcaa atcctcagtt tgtatgtacg tacgtactac aatcatcacc acgatcgtcc 1020 
gcccgacgat gcggcttctg ttcgcctgcc cctcctctca ctcgtgccct cgacgagcta 1080 
gccccgccag gactctcctg cgtcaccaat tcttttccct atttacccct cctccccctc 1140 
tccctctcgt ttcttcctaa caaacaacca ccaccaaaat ctctttggaa gctcacgact 1200 
cacgcagctc aatccgcaga tacaaatcta ga 1232 



<210> 11 
<211> 1129 
<212> DNA 

<213> Trichoderma reesei 
<220> 

<221> terminator 

<222> (1) . . (1129) 

<223> T. reesei gpdl terminator 

<400> 11 

ggatcccgag cattgtctat gaatgcaaac aaaaatagta aataaatagt aattctggcc 60 
atgacgaata gagccaatct gctccacttg actatcttgt gactgtatcg tatgtcgaac 120 
ccttgactgc ccattcaaac aattgtaaag gaatatagct acaagttatg tctcacgttt 1B0 
gcgtgcgagc ccgtttgtac gttattttga gaaagcgttg ccatcacatg ctcacagtca 240 
cttggcttac gatcatgttt gcgatcttcg gtaagaatac acagagtaac gattatctcc 300 
atcgcttcta tgattaggta ctcagacaac acatgggaaa caagataacc atcgcatgca 360 
aggtcgattc caatcatgat ctggactggg gtattccatc taagccatag taccctc^g 420 
agaaggaatg gtaggacctc tcaggcgtcc accatctgtg ctgcaaatcc aagaaacccc 480 
ccaaaagcac ctacctatct acctagagta actgcacgag aaaagaaaag gagcagaaga 540 
agaatgatct caagaggccg tgaacgcaga aacacactcc tcccaacttt tcaagttttg 600 
aacaaaaaaa gaaagatgag gactagaaga tggagtattt ccttcttaga gagctctcgg 660 
tgaggtgacc tgtcagggtt taccgcaaac cgtcggtggt tctatccaat taatcaagtc 720 
ccgcgcctcg cctcttctct cctgtccttt catagaatcc cgtctccttg ttgcttgatc 780 
gaagcggggt tatcgacgcc accaaagatc ttgtcttggt gacttatcaa tcctttggtg 840 
atcaaacagc ccccgagtga tcagatccgt aaaagaagaa gaagagtacg atttaaccag 900 
accgaggaac aataaagcga gtaaataaca tcaaaacaag agtctcgttg aaaattactt 960 
gttcctcaat caatcccaac ccccctaaaa gcccttcccc ccatggtata tcccggcagt 1020 
aggagagaga tatttccact accgctcacc accaagtgag gcttgccgag agaagaggat 1080 
gaatcagaag tgacaacaac gggttgagca catgggatat cggcgcgcc 1129 
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<210> 12 
<:211> 5733 
<212> DNA 

<213> Aspergillus nidulans 
■=220> 

«l223> (1-5733) Sequence of plasmid pAN52-l 
i220> 

<22l> promoter 

<222> ( 1) . . (2129) 

<223> A. nidulans gpdA promoter 

<220> 

<22i> gene 

^222> (2130) . . (2304) 

<;223> A. nidulans gpdA gene 

<220> 

*221> terminator 

<222> (2305) . . (3071) 

•:223> A, nidulans trpC terminator 

<220> 

<221> misc_f eature 

<222> (3072) . . (5726) 

<223> pUCIS from Sail to EcoRI 

<400> 12 

caattccctt gtatccctac acacaggctc aaatcaataa gaagaacggt tcgtctttct 60 
cgtttatatc ttgcatcgtc ccaaagctat tggcgggata ttctgtttgc agttggctga 120 
cttgaagtaa tctctgcaga tctttcgaca ctgaaatacg tcgagcctgc tccgcttgga 18 0 
agcggcgagg agcctcgtcc tgtcacaact accaacatgg agtacgataa gggccagttc 24 0 
cgccagctca ttaagagcca gttcatgggc gctggcatga tggccgtcat gcatctgtac 300 
ttcaagtaca ccaacgctct tctgatccag tcgatcatcc gctgaaggcg ctttcgaatc 360 
tggttaagat ccacgtcttc gggaagccag cgactggtga cctccagcgt ccctttaagg 420 
gtgccaacag ctttctcagc cagggccagc ccaagaccga caaggcctcc ctccagaacg 480 
ccgagaagaa ctggaggggt ggtgtcaagg aggagtaagc tccttattga agtcggagga 540 
cggagcggtg tcaagaggat attcttcgac tctgtattat agataagatg atgaggaatt 600 
ggaggtagca tagcttcatt tggatttgct ttccaggctg agactctagc ttggagcata 660 
gagggtcctt tggctttcaa tattctcaag tatctcgagt tcgaacttat tccctgtgaa 720 
ccttttattc accaatgagc attggaatga acatgaatct gaggactgca atcgccatga 780 
ggttttcgaa atacatccgg atgtcgaagg cttggggcac ctgcgttggt tgaatttaga 840 
acgtggcact attgatcatc cgatagctct gcaaagggcg ttgcacaatg caagtcaaac 900 
gttgctagca gttccaggtg gaatgttatg atgagcattg tatcaaatca ggagatatag 960 
catgatctct agttagctca ccacaaaagt cagacggcgt aaccaaaagt cacacaacac 1020 
aagctgtaag gatttcggca cggctacgga agacggagaa gccaccttca gtggactcga 1080 
gtaccattta attctatttg tgtttgatcg agacctaata cagcccctac aacgaccatc 1140 
aaagtcgtat agctaccagt gaggaagtgg actcaaatcg acttcagcaa catctcctgg 1200 
ataaacttta agcctaaact atacagaata agataggtgg agagcttata ccgagctccc 1260 
aaatctgtcc agatcatggt tgaccggtgc ctggatcttc ctatagaatc atccttattc 1320 
gttgacctag ctgattctgg agtgacccag agggtcatga cttgagccta aaatccgccg 1380 
cctccaccat ttgtagaaaa atgtgacgaa ctcgtgagct ctgtacagtg accggtgact 1440 
ctttctggca tgcggagaga cggacggacg cagagagaag ggctgagtaa taagccactg 1500 
gccagacagc tctggcggct ctgaggtgca gtggatgatt attaatccgg gaccggccgc 1560 
ccccccgccc cgaagtggaa aggctggtgt gcccctcgtt gaccaagaat ctattgcatc 1620 
atcggagaat atggagcttc atcgaatcac cggcagtaag cgaaggagaa tgtgaagcca 1680 
ggggtgtata gccgtcggcg aaatagcatg ccattaacct aggtacagaa gtccaattgc 174 0 
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ttccgatctg 
ccggcgcgca 
ctctcctgct 
cagaccggga 
gtccctcagt 
tgacaaggtc 
ctgctctttt 
acctttattt 
attcctttga 
cccgcttgag 
gacgaatctg 
gttcaggatc 
tttaatagct 
agctcatctg 
cgaagagaag 
caacggtcgc 
tatttgtctc 
aaaat^tccgt 
tcaagcctac 
atggtacaca 
ccggccgaaa 
ttgtttagag 
catctccact 
gactgggaaa 
agctggcgta 
aatggcgaat 
cgcatatggt 
cacccgccaa 
agacaagctg 
aaacgcgcga 
ataatggttt 
tgtttatttt 
atgcttcaat 
attccctttt 
gtaaaagatg 
agcggtaaga 
aaagctctgc 
cgccgcatac 
cttacggatg 
actgcggcca 
cacaacatgg 
ataccaaacg 
ctattaactg 
gcggataaag 
gataaatctg 
ggtaagccct 
cgaaatagac 
caagttcact 
taggtgaaga 
cactgagcgt 
cgcgtaatct 
gatcaagagc 
aatactgtcc 
cctacatacc 
tgtcttaccg 
acggggggtt 
ctacagcgtg 
ccggtaagcg 



gtaaaagatt 
agccccctaa 
ttgcccggtg 
acacaagctg 
ccctggtagg 
gttgcgccag 
cttttctctt 
cccctaagta 
acctttcagt 
cagacatcac 
gatataagat 
tcgataagat 
ccatgtcaac 
caatgcatta 
aatagcttag 
caagagacct 
taattgtact 
caccagcncc 
aggacacaca 
atagtaacca 
cttttttaca 
gtaatccctc 
cgacctgcag 
accctggcgt 
atagcgaaga 
ggcgcctgat 
gcactctcag 
cacccgctga 
tgaccgtctc 
gacgaaaggg 
cttagacgtc 
tctaaataca 
aatattgaaa 
ttgcggcatt 
ctgaagatca 
tccttgagag 
tatgtggcgc 
actattctca 
gcatgacagt 
acttacttct 
gggatcatgt 
acgagcgtga 
gcgaactact 
tcgcaggacc 
gagccggtga 
cccgtatcgt 
agatcgctga 
catatatact 
tcctctttga 
cagaccccgt 
gctgcttgca 
taccaactct 
tcctagtgta 
tcgctctgct 
ggttggactc 
cgtgcacaca 
age tat gaga 
geagggtegg 



cacgagatag 
ttggcccatc 
tatgaaaccg 
gcagtcgacc 
cagctttgcc 
tccaacatcc 
tcttttccca 
agtactttgc 
tcgagctttc 
catggatcca 
cgttggtgtc 
aegttcaett 
aagaataaaa 
atgeattgae 
cagagctatt 
acgagactga 
ttgacatget 
tgggttcgca 
ttcategtag 
tgcatggttg 
actctcctat 
cttctagaag 
geatgeaage 
tacccaactt 
ggcccgcacc 
gcggtatttt 
tacaatctgc 
cgcgccctga 
egggagctge 
ectegtgata 
aggtggcact 
ttcaaatatg 
aaggaagagt 
ttgccttcct 
gttgggtgca 
ttttcgcccc 
ggtattatcc 
gaatgacttg 
aagagaatta 
gacaacgatc 
aactcgcctt 
caccacgatg 
tactctagct 
acttctgcgc 
gcgtgggtct 
agttatctac 
gataggtgee 
ttagattgat 
taatctcatg 
agaaaagatc 
aacaaaaaaa 
ttttccgaag 
geegtagtta 
aatcctgtta 
aagacgatag 
gcccagcttg 
aagcgccacg 
aacaggagag 



taccttctcc 
eggcatctgt 
gaaaggcege 
catccggtgc 
ccgtctgtcc 
gttgecatat 
tcttcagtat 
tacatccata 
ccacctcatc 
ettaaegtta 
gatgtcagct 
gtccaagcag 
cgcgttttcg 
cgcaacctag 
ttcattttcg 
ggaatccget 
cctcttcttt 
aagataattg 
gtataaacct 
cctagtgaat 
gagtegttta 
tcctcgtgta 
ttggcactgg 
aatcgccttg 
gatcgccctt 
ctccttacgc 
tetgatgecg 
cgggctcgtc 
atgtgtcaga 
cgcctatttt 
tttcggggaa 
tatccgctca 
atgagtattc 
gtttttgetc 
cgagtgggtt 
gaagaacgtt 
cgtattgacg 
gttgagtact 
tgcagtgctg 
ggaggaccga 
gatcgttggg 
cctgtagcaa 
tcccggcaac 
ccggcccttc 
cgeggtatea 
aegaegggga 
tcactgatta 
ttaaaacttc 
accaaaatcc 
aaaggatctt 
ccaccgctac 
gtaactggct 
ggccaccact 
ccagtggctg 
ttaceggata 
gagegaaega 
cttcccgaag 
cgcacgaggg 



gaagtaggta 


gagegagtae 


1800 


agggegtcca 


aatatcgtgc 


1860 


tcaggagctg 


gccagcggcg 


1920 


tctgcactcg 


acctgccgag 


1980 


gcccggtgtg 


ccggcggggt 


2 04 0 


tttcctgctc 


tccccaccag 


2100 


attcatcttc 


ccatccaaga 


2160 


ctccatcctt 


cccatccct t 


2220 


gcagcttgac 


taacagctac 


2280 


ctgaaatcat 


caaacagctt 


2340 


ccggagctga 


gacaaatggt 


2400 


caaagagtgc 


cttctagtga 


2460 


ggtttacctc 


ttccagatac 


2520 


taacgccttn 


caggctccgg 


2580 


ggagacgaga 


tcaagcagat 


2640 


cttggctcca 


cgegactata 


2700 


actctgatag 


cttgactatg 


2760 


catgtttctt 


ccttgaactc 


2820 


cgaaatcant 


tcctactaag 


2880 


gctccgtaac 


acccaatacg 


2940 


cccagaatgc 


acaggtacac 


3000 


ctgtgtaagc 


gcccactcca 


3060 


ccgtcgtttt 


acaaegtegt 


3120 


cagcacatcc 


ccctttcgcc 


3180 


cccaacagtt 


gcgcagcctg 


3240 


atctgtgcgg 


tatttcacac 


3300 


catagttaag 


ccagccccga 


33G0 


tgctcccggc 


atccgcttac 


3420 


ggttttcacc 


gtcatcaccg 


3 480 


tataggttaa 


tgtcatgata 


3540 


atgtgcgcgg 


aacccctatt 


3600 


tgagacaata 


accctgataa 


3660 


aacatttccg 


tgtcgccctt 


3720 


acccagaaac 


gctggtgaaa 


3780 


acatcgaact 


ggatctcaac 


3840 


ttccaatgat 


gagcactttt 


3900 


cegggcaaga 


geaacteggt 


3960 


caccagtcac 


agaaaagcat 


4020 


ccataaccat 


gagtgataac 


4080 


aggagctaac 


cgcttttttg 


4140 


aaceggaget 


gaatgaagee 


4200 


tggcaacaac 


gttgcgcaaa 


4260 


aattaataga 


ctggatggag 


4320 


cggccggctg 


gtteattget 


4380 


ttgeagcact 


ggggecagat 


4440 


gtcaggcaac 


tatggatgaa 


4500 


agcattggta 


actgtcagac 


4560 


atttttaatc 


taaaaggatc 


4620 


cttaacgtga 


gttttcgttc 


4680 


cttgagatcc 


tettettctg 


4740 


cagcggtggt 


ttgtttgccg 


4800' 


ccagcagagc 


gcagatacca 


4860 


tcaagaacrtc 


tgtagcaccg 


4920 


ctgccagtgg 


cgataagtcg 


4980 


aggegcageg 


gtcgggctga 


5040 


cctacaccga 


actgagacac 


5100 


ggagaaaggc 


ggacaggtat 


5160 


agcttccagg 


gggaaacgee 


5220 



• 
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tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga 5280 

tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc 534G 

ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg 5400 

gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag 5460 

cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gccccccccc 5520 

gcgcgttggc cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc 5580 

agcgagcgca acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac 5640 

tttatgcttc cggcccgtat gttgtgtgga attgtgagcg gataacaact tcacacagga 5700 

aacagctatg accatgatta cgaattgcgg ccg 5733 



<210> 13 
<211> 39 
<2X2> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 13 

gtcaaccgcg gactgcgcat catgaagttc ttcgccatc 39 



<210> 14 
<211> 66 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of .Artificial Sequence: PCR 5' primer 
<400> 14 

tctagcaagc ttggctctag ttctggaacc gcaccaggcg gcagcaacgg caacggcaat 60 
gtttgc 66 



<210> 15 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 15 

tcgtacaagc tttcaagcac cgacggcggt 3 0 



<210> 16 
<211> 33 
<212> DNA 

<2l3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 



<400> 16 

tctagctcta gaagcaacgg caacggcaat gtt 



33 



WO 00/58342 



9 



PCT/FIOO/00249 



<210> 17 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 17 

tgctagtcga cctgctagca gcaccgacgg cggtctg 37 



<210> 18 
<211> 4614 
<212> DNA 

<213> Saccharomyces cerevisiae 
<220> 

<223> S. cerevisiae FLOl coding sequence 
<400> IB 

atgacaatgc ctcatcgcta tatgtttttg gcagtcttta cacttctggc actaactagt 60 
gtggcctcag gagccacaga ggcgtgctta ccagcaggcc agaggaaaag tgggatgaat 12 0 
ataaattttt accagtattc attgaaagat tcctccacat attcgaatgc agcatatatg 180 
gcttatggat atgcctcaaa aaccaaacta ggttctgtcg gaggacaaac tgatatctcg 240 
attgattata atattccctg tgttagttca tcaggcacat ttccttgtcc tcaagaagat 300 
tcctatggaa actggggatg caaaggaatg ggtgcttgtt ctaatagtca aggaattgca 360 
tactggagta ctgatttatt tggtttctat actaccccaa caaacgtaac cctagaaatg 420 
acaggttatt ttttaccacc acagacgggt tcttacacat tcaagtttgc tacagttgac 460 
gactctgcaa ttctatcagt aggtggtgca accgcgttca actgttgtgc tcaacagcaa 540 
ccgccgatca catcaacgaa ctttaccatt gacggtatca agccatgggg tggaagtttg 600 
ccacctaata tcgaaggaac cgtctatatg cacgctggct actattatcc aatgaaggtc 660 
gtttactcga acgctgtttc ttggggtaca cttccaatta gtgtgacact tccagatggt 720 
accactgtaa gtgatgactt cgaagggtac gtctattcct ttgacgatga cctaagtcaa 780 
tctaactgta ctgtccctga cccttcaaat tatgctgtca gtaccactac aactacaacg 84 0 
gaaccatgga ccggtacttt cacttctaca tctactgaaa tgaccaccgt caccggtacc 900 
aacggcgttc caactgacga aaccgtcatt gtcatcagaa ctccaacaac tgctagcacc 960 
atcataacta caactgagcc atggaacagc acttttacct ctacttctac cgaattgacc 1020 
acagtcactg gcaccaatgg tgtacgaact gacgaaacca tcattgtaat cagaacacca 1060 
acaacagcca ctactgccat aactacaact gagccatgga acagcacttt tacctctact 1140 
tctaccgaat tgaccacagt caccggtacc aatggtttgc caactgatga gaccatcatt 1200 
gtcatcagaa caccaacaac agccactact gccatgacta caactcagcc atggaacgac 1260 
acttttacct ctacatccac tgaaatgacc accgtcaccg gtaccaacgg tttgccaact 1320 
gatgaaacca tcattgtcat cagaacacca acaacagcca ctactgctat gactacaact 13B0 
cagccatgga acgacacttt tacctctaca tccactgaaa tgaccaccgt caccggtacc 1440 
aacggtttgc caactgatga aaccatcatt gtcatcagaa caccaacaac agccactact 1500 
gccatgacta caactcagcc atggaacgac acttttacct ctacatccac tgaaatgacc 1560 
accgtcaccg gtaccaatgg tttgccaact gatgagacca tcattgtcat cagaacacca 1620 
acaacagcca ctactgccat gactacaact cagccatgga acgacacttt tacctctaca 1680 
tccactgaaa tgaccaccgt caccggtacc aacggtttgc caactgatga aaccatcatt 1740 
gtcatcagaa caccaacaac agccactact gccataacta caactgagcc atggaacagc 1800 
acttttacct ctacttctac cgaattgacc acagtcaccg gtaccaatgg tttgccaact I860 
gatgagacca tcattgtcat cagaacacca acaacagcca ctactgccat gactacaact 1920 
cagccatgga acgacacttt tacctctaca tccactgaaa tgaccaccgt caccggtacc 1980 
aacggtttgc caactgatga aaccatcatt gtcatcagaa caccaacaac agccactact 2040 
gccatgacta caactcagcc atggaacgac acttttacct ctacatccac tgaaatgacc 2100 
accgtcaccg gtaccaacgg tttgccaact gatgagacca tcattgtcat cagaacacca 2160 
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acaacagcca ctactgccat gactacaact cagccatgga acgacacttt tacctctaca 2220 
tccactgaaa tgaccaccgt caccggtacc aacggcgttc caactgacga aaccgtcatt 2280 
gtcatcagaa ctccaactag tgaaggtcta atcagcacca ccactgaacc atggactggt 234 0 
actttcacct ctacatccac tgagatgacc accgtcaccg gtactaacgg tcaaccaact 2400 
gacgaaaccg tgattgttat cagaactcca accagtgaag gtttggttac aaccaccact 2460 
gaaccatgga ctggtacttt tacttctaca tctactgaaa tgaccaccat tactggaacc 2520 
aacggcgttc caactgacga aaccgtcatt gtcatcagaa ctccaaccag tgaaggtcta 25S0 
atcagcacca ccactgaacc atggactggt acttttactt ctacatctac tgaaatgacc 264 0 
accattactg gaaccaatgg tcaaccaact gacgaaaccg ttattgttat cagaactcca 2700 
actagtgaag gtctaatcag caccaccact gaaccatgga ctggtacttt cacttctaca 2760 
tctactgaaa tgaccaccgt caccggtacc aacggcgttc caactgacga aaccgtcatt 2820 
gtcatcagaa ctccaaccag tgaaggtcta atcagcacca ccactgaacc atggactggc 2880 
actttcactt cgacttccac tgaggttacc accatcactg gaaccaacgg tcaaccaact 2940 
gacgaaactg tgattgttat cagaactcca accagtgaag gtctaatcag caccaccact 3000 
gaaccatgga ctggtacttt cacttctaca tctgctgaaa tgaccaccgt caccggtact 306 0 
aacggtcaac caactgacga aaccgtgatt gttatcagaa ctccaaccag tgaaggtttg 3120 
gttacaacca ccactgaacc atggactggt acttttactt cgacttccac tgaaatgtct 3180 
actgtcactg gaaccaatgg cttgccaact gatgaaactg tcattgttgt caaaactcca 324=0 
actactgcca tctcatccag tttgtcatca tcatcttcag gacaaatcac cagctctatc 3300 
acgtcttcgc gtccaattat taccccattc tatcctagca atggaacttc tgtgatttct 3360 
tcctcagtaa tttcttcctc agtcacttct tctctattca cttcttctcc agtcatttct 3420 
tcctcagtca tttcttcttc tacaacaacc tccacttcta tattttctga atcatctaaa 34&0 
tcatccgtca ttccaaccag tagttccacc tctggttctt ctgagagcga aacgagttca 3540 
gctggttctg tctcttcttc ctcttttatc tcttctgaat catcaaaatc tcctacatat 3600 
tcttcttcat cattaccact tgttaccagt gcgacaacaa gccaggaaac tgcttcttca 3660 
ttaccacctg ctaccactac aaaaacgagc gaacaaacca ctttggttac cgtgacatcc 3720 
tgcgagtctc atgtgtgcac tgaatccatc tcccctgcga ttgtttccac agctactgtt 3780 
actgttagcg gcgtcacaac agagtatacc acatggtgcc ctatttctac tacagagaca 3840 
acaaagcaaa ccaaagggac aacagagcaa accacagaaa caacaaaaca aaccacggta 3900 
gttacaattt cttcttgtga atctgacgta tgctctaaga ctgcttctcc agccattgta 3960 
tctacaagca ctgctactat taacggcgtt actacagaat acacaacatg gtgtcctatt 4 02 0 
tccaccacag aatcgaggca acaaacaacg ctagttactg ttacttcctg cgaatctggt 4080 
gtgtgttccg aaactgcttc acctgccatt gtttcgacgg ccacggctac tgtgaatgat 414 0 
gttgttacgg tctatcctac atggaggcca cagactgcga atgaagagtc tgtcagctct 4200 
aaaatgaaca gtgctaccgg tgagacaaca accaatactt tagctgctga aacgactacc 426 0 
aatactgtag ctgctgagac gattaccaat actggagctg ctgagacgaa aacagtagtc 4320 
acctcttcgc tttcaagatc taatcacgct gaaacacaga cggcttccgc gaccgatgtg 4380 
attggtcaca gcagtagtgt tgtttctgta tccgaaactg gcaacaccaa gagtctaaca 4440 
agttccgggt tgagtactat gtcgcaacag cctcgtagca caccagcaag cagcatggta 4 500 
ggatatagta cagcttcttt agaaatttca acgtatgctg gcagtgccaa cagcttactg 4560 
gccggtagtg gtttaagtgt cttcattgcg tccttattgc tggcaattat ttaa 4614 



<210> 19 
<211> 5857 
<212> DMA 

<213> Sac char omyces cerevisiae and B. coli 
<220> 

<221> promoter 
<222> (1) . . (452) 

<223> S. cerevisiae GAL1 promoter 
<220> 

<223> (476-495) E. coli T7 promoter/priming site 
<220> 

<223> (502-601) E. coli multiple cloning site 
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<220> 

<223> (609-857) S. cerevisiae CYC1 transcription terminator 
<220> 

<223> (1039-1712) E. coli pMBl (pUC-derivedj origin 
<220> 

<221> gene 

<222> (1857) . . (2717) 

<223> E. coli ampicillin resistance gene 
<220> 

<221> gene 

<222> (2735) . . (3842) 

<223> S. cerevisiae URA3 gene 

<220> 

<223> (3846-5317) s. cerevisiae 2 micron origin 
<220> 

<223> (5385-5840) E. coli fl origin 
<220> 

<223> (1-5857) Sequence of pYES2 
<400> 19 

acggattaga agccgccgag cgggtgacag ccctccgaag gaagactctc ctccgtgcgt 60 
cctcgtcctc accggtcgcg ttcctgaaac gcagatgtgc ctcgcgccgc actgctccga 120 
acaataaaga ttctacaata ctagctttta tggttatgaa gaggaaaaat tggcagtaac 180 
ctggccccac aaaccttcaa atgaacgaat caaattaaca accataggat gataatgcga 240 
ttagtttttt agccttattt ctggggtaat taatcagcga agcgatgatt tttgatctat 300 
taacagatrat ataaatgcaa aaactgcatt aaccacttta actaatactt tcaacatttt 360 
cggtttgtat tacttcttat tcaaatgtaa taaaagtatc aacaaaaaat tgttaatata 420 
cctctatact ttaacgtcaa ggagaaaaaa ccccggatcg gactactagc agctgtaata 480 
cgactcacta tagggaatat taagcttggt accgagctcg gatccactag taacggccgc 540 
cagtgtgctg gaattctgca gatatccatc acacCggcgg ccgctcgagc atgcatctag 600 
agggccgcat catgtaatta gttatgtcac gcttacattc acgccctccc cccacatccg 660 
ctctaaccga aaaggaagga gttagacaac ctgaagtcta ggtccctatt tattttttta 720 
tagttatgtt agtattaaga acgttattta tatttcaaat ttttcttttt tttctgtaca 780 
gacgcgtgta cgcatgtaac attatactga aaaccttgct tgagaaggtt ttgggacgct 840 
cgaaggcttt aatttgcggc cctgcattaa tgaatcggcc aacgcgcggg gagaggcggt 900 
ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 960 
ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg 1020 
gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa agcccaggaa ccgtaaaaag 1080 
gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 1140 
cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 1200 
ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 1260 
tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg 1320 
gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 1380 
tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 1440 
ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 1500 
ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct 1560 
ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 1620 
accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 1680 
tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 1740 
cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 1800 
taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac i860 
caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 1920 
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gcctgactcc ccgccgtgta gataactacg acacgggagc gcttaccatc tggccccagt 1980 
gccgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag 2040 
ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc cattcagcct 2100 
attaactgtt gccgggaagc tagagtaagc agttcgccag ttaatagttt gcgcaacg^c 2160 
gccggcattg ccacaggcat cgtggtgtca ctctcgtcgt ctggcatggc ttcattcagc 222C 
tccggttccc aacgaccaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggct 2280 
agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtc atcactcatg 2340 
gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg 2400 
actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct 2460 
tgcccggcgt caatacggga taatagtgca tcacatagca gaactttaaa agtgctcatc 2520 
attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt 258 0 
tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt 2S40 
tccgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg 2700 
aaatgttgaa tactcatact cttccttttt caacgggtaa taactgatat aattaaattg 2760 
aagctctaat ttgtgagttt agcatacatg catttactta taatacagtt ttttagtttt 2820 
gctggccgca tcttctcaaa tatgcttccc agcctgcttt tctgtaacgt ccaccctcta 2880 
ccttagcatc ccttcccttt gcaaatagtc ctcttccaac aataataatg tcagatcctg 2940 
tagagaccac atcatccacg gttctatact gttgacccaa tgcgtctccc tcgtcatcta 3000 
aacccacacc gggcgtcata atcaaccaat cgtaaccttc atctcttcca cccatgtctc 3060 
tttgagcaat aaagccgata acaaaatctt tgtcgctctt cgcaacgtca acagtaccct 3120 
tagtatattc tccagtagat agggagccct tgcatgacaa ttctgctaac atcaaaaggc 3180 
ccccaggttc ctttgttact tcttctgccg cctgcttcaa accgctaaca atacctgygc 3240 
ccaccacacc gcgtgcattc gtaatgcctg cccattctgc tattctgtat acacccgcag 3300 
agtactgcaa tttgactgta ttaccaatgt cagcaaattt tctgtcttcg aagagtaaaa 3360 
aattgtactt ggcggataat gcctttagcg gcttaactgt gccctccatg gaaaaaccag 3420 
tcaagatatc cacatgtgtt tttagtaaac aaattttggg acctaatgct tcaactaact 3480 
ccagtaattc cttggtggta cgaacatcca atgaagcaca caagtttgtt tgcttttcgt 3540 
gcacgatatt aaacagctcg gcagcaacag gactaggatg agtagcagca cgttccttat 3600 
atgtagcttt cgacatgatt tatcttcgtt tcctgcaggt ttttgttctg tgcagttggg 3660 
ttaagaatac tgggcaattt catgtttctt caacactaca tatgcgtata tataccaatc 3720 
taagtctgtg ctccCtcctt cgctcttcct tctgttcgga gattaccgaa ccaaaaaaat 3780 
ttcaaagaaa ccgaaatcaa aaaaaagaat aaaaaaaaaa tgatgaattg aattgaaaag 3840 
ctagcttatc gatgataagc tgtcaaagat gagaattaat tccacggact atagactata 3900 
ctagatactc cgtctactgt acgatacact tccgctcagg tccttgtcct tcaacgaggc 3960 
cttaccactc ttttgttact ctattgatcc agctcagcaa aggcagtgtg acctaagatt 4020 
ctatcttcgc gatgtagtaa aactagctag accgagaaag agactagaaa cgcaaaaggc 4O80 
acttctacaa cggctgccat cattattatc cgatgtgacg ctgcagcttc tcaatgatat 4140 
tcgaatacgc tttgaggaga tacagcctaa tatccgacaa actgtcttac agatttacga 4200 
tcgtacttgt tacccatcat tgaatcttga acatccgaac ctgggagttt tccctgaaac 4260 
agatagtata tttgaacctg tataataata tatagtctag cgcttcacgg aagacaatgt 4320 
atgtatttcg gttcctggag aaactattgc atctattgca taggtaatct tgcacgtcgc 4380 
atccccggtt cattttctgc gtttccatct tgcacttcaa tagcatatct ttgttaacga 4440 
agcatctgtg cttcattttg tagaacaaaa atgcaacgcg agagcgctaa cttttcaaac 4500 
aaagaatctg agctgcattt ttacagaaca gaaatgcaac gcgaaagcgc tattttacca 4560 
acgaagaatc tgtgcttcat ttttgtaaaa caaaaatgca acgcgacgag agcgctaatt 4620 
tttcaaacaa agaatctgag ctgcattttt acagaacaga aatgcaacgc gagagcgcta 4680 
tcccaccaac aaagaatcta tacttctttt ttgttctaca aaaatgcatc ccgagagcgc 4740 
tattcttcta acaaagcatc ttagattact tttcttcCcc tttgtgcgct ctataatgca 4800 
gtctcttgat aactttctgc actgtaggtc cgctaaggtt agaagaaggc tactttggtg 4860 
tctattttct cttccacaaa aaaagcctga ctccacttcc cgcgtttact gattactagc 4920 
gaagctgcgg gtgcatcttt tcaagataaa ggcatccccg attatattct ataccgatgt 4980 
ggattgcgca tactttgtga acagaaagtg atagcgttga tgattcttca ctggtcagaa 5040 
aatcatgaac ggtttcttct actttgtctc tatatactac gtataggaaa cgtctacact 5100 
ttcgtattgt tttcgactca ctctacgaac agttcttact acaatCCCtc tgtctaaaga 5160 
gtaatactag agataaacat aaaaaatgta gaggtcgagt ttagatgcaa gctcaaggag 5220 
cgaaaggtgg atgggtaggt tatataggga tatagcacag agatatatag caaagagata 5280 
ctttcgagca acgtttgtgg aagcggtatt cgcaatggga agctccaccc cggttgacaa 5340 
tcagaaaagc cccaaaaaca ggaagattgt ataagcaaac atctaaactg taaacgttaa 54 00 
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tatttcgtta aaattcgcgt taaatttttg ttaaatcagc tcatttttta acgaatagcc 5460 
cgaaatcggc aaaatccctt ataaatcaaa agaatagacc gagatagggt tgagtgttgt 5520 
tccagtttcc aacaagagtc cactattaaa gaacgtggac tccaacgtca aagggcgaaa 5580 
aagggtctat cagggcgatg gcccactacg tgaaccatca ccctaatcaa gttttttggg 564 0 
gtcgaggtgc cgtaaagcag taaatcggaa gggtaaacgg atgcccccat ttagagcttg 5700 
acggggaaag ccggcgaacg tggcgagaaa ggaagggaag aaagcgaaag gagcgggggc 5760 
tagggcggtg ggaagtgtag gggtcacgct gggcgtaacc accacacccg ccgcgcttaa 5820 
cggggcgcca cagggcgcgt ggggatgatc cactagt 5857 



<210> 20 
<211> 403 
<212> DNA 

<213> Trlchodeima reesei 



<220> 

<223> (1-403) T. reesei hfb2 coding sequence 



<220> 

<221> intron 
<222> (131) . . (200) 



<220> 

<221> intron 

<222> (287) . . (358) 



<400> 20 

atgcagttct Ccgccgtcgc cctcttcgcc accagcgccc tggctgctgt ctgccctacc 60 

ggcctcttct ccaaccctct gtgctgtgcc accaacgtcc tcgacctcat tggcgttgac 120 

tgcaagaccc gtatgttgaa ctccaatctc cgggcatcct gacattggac gatacagttg 180 

acttacacga tgctttacag ctaccatcgc cgtcgacact ggcgccatct tccaggctca 240 

ctgtgccagc aagggctcca agcctctttg ctgcgttgct cccgtggtaa gtagtgctcg 300 

caatggcaaa gaagtaaaaa gacatttggg cctgggatcg ctaactcttg atatcaaggc 360 
cgaccaggct ctcctgtgcc agaaggccat cggcaccttc taa 403 



<210> 21 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 21 

cggaggagct cgacgacttc gagcagcccg agctgcacgc aggctgtctg ccctaccgg 59 



<210> 22 
<211> 29 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 3' primer 



<400> 22 

tcattggatc cttagaaggt gccgatggc 



29 
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<210> 23 
<211> 679 
<212> DNA 

<213> Schizophyllum commune 
<220> 

<223> (1-679) SC3 coding sequence 
<220> 

<223> (1-92) 1st cDNA 
<220> 

<223> (146-183) 2nd cDNA 
<220> 

<223> (240-317) 3rd cDNA 
<220> 

<223> (374-469) 4th cDNA 
<:220> 

<=223> (524-586) 5th CDNA 
<220> 

<223> (635-679) 6th cDNA 
<400> 23 

atgttcgccc gtctccccgt cgtgttcctc tacgccttcg tcgcgttcgg cgccctcgtc 60 
gctgccctcc caggtggcca cccgggcacg acgtacgtcg acctctcacc gtcctctaat 120 
gtcttgctga tgaagccccg tatagcacgc cgccggttac gacgacggtg acggtgacca 180 
cggtgagtag ctttctcgcc gtcgacgact cgaacgcatt ggctaatttt tgctcatagc 240 
cgccctcgac gacgaccatc gccgccggtg gcacgtgtac tacggggtcg ctctcttgct 300 
gcaaccaggt tcaatcggta cgtacatcaa agcggcacga ccaggcatct cagctgacgg 360 
ccacatcgta caggcgagca gcagccctgt taccgccctc ctcggcctgc tcggcattgt 420 
cctcagcgac ctcaacgttc tcgttggcat cagctgctct cccctcactg tgagatcttt 480 
ttgttcactg tcccaattac tgcgcactga cagactctgc caggtcatcg gtgtcggagg 540 
cagcggctgt tcggcgcaga ccgtctgctg cgaaaacacc caattcgtat gtatactttc 600 
catgcgtgtc cctttctccg ctaatcatct gtagaacggg ctgatcaaca tcggttgcac 660 
ccccatcaac atcctctga 679 



<210> 24 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 



<400> 24 

actacacgga ggagctcgac gacttcgagc agcccgagct gcacgcaggg tggccacccg 60 

ggc 63 
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<210> 25 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 25 

tcgtacggat cctcagagga tgttgatggg 30 

<210> 26 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 26 

ggaattccgc ggactgcgca tcatgaagtt cttcgccatc gcc 43 

<210> 27 
<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 27 

tgaattccat atgttaggta ccaccggggc ccatgccggt agaagtagaa gccccgggag 60 
caccgacggc ggtctggcac 80 

<210> 28 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 28 

tgaattcggt acccaggctt gctcaagcgt c 31 

<210> 29 
<211> 34 
<212> DNA 

<2i3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
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<400> 29 

tgaattccat atgtcacagg cactgagagt agta 34 



<2io> 30 

<211> 48 
<212> DNA 

<213> Artificial Sequence 
<22Q> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 30 

gaattcggta ccctcgtccc tcgcggtccc gccgaagtga acctggtg 4 8 



<210> 31 
<211> 34 
<2X2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 31 

tgaattccat atgctaaccc cgtttcatct ccag 34 



<210> 32 

<211> 91B 

<2I2> DNA 

<213> Trichoderma reesei 
<220> 

<22i> terminator 

<222> (1) . . (918) 

<223> T. reesei hfbl terminator 

<4D0> 32 

gatgcccgcc cggggtcaag gtgtgcccgt gagaaagccc acaaagtgtt gatgaggacc 60 
atttccggta ctgggaaagt tggctccacg tgtttgggca ggtttgggca agttgtgtag 120 
atattccatt cgtacgccat tcttattctc caatatttca gtacactttt cttcataaat 180 
caaaaagact gctattctct ttgtgacatg ccggaaggga acaattgctc ttggtctctg 240 
ttatttgcaa gtaggagtgg gagattcgcc ttagagaaag tagagaagct gtgcttgacc 300 
gtggtgtgac tcgacgagga tggactgaga gtgttaggat taggtcgaac gttgaagtgt 360 
atacaggatc gtctggcaac ccacggatcc catgacttga tgcaatggtg aagatgaatg 420 
acagtgtaag aggaaaagga aatgtccgcc ttcagctgat atccacgcca atgatacagc 480 
gatatacctc caatatctgt gggaacgaga catgacatat ttgtgggaac aacttcaaac 540 
agcgagccaa gacctcaata tgcacatcca aagccaaaca ttggcaagac gagagacagt 600 
cacattgtcg tcgaaagatg gcatcgtacc caaatcatca gctctcatta tcgcctaaac 660 
cacagattgt ttgccgtccc ccaactccaa aacgttacta caaaagacat gggcgaatgc 720 
aaagacctga aagcaaaccc tttttgcgac tcaattccct cctttgtcct cggaatgatg 780 
atccttcacc aagtaaaaga aaaagaagat tgagataata catgaaaagc acaacggaaa 840 
cgaaagaacc aggaaaagaa taaatctatc acgcaccttg tccccacact aaaagcaaca 900 
gggggggtaa aatgaaat 918 
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<210> 33 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 33 

gacctcgatg cccgcccggg gtcaag 26 



<210> 34 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; PCR 3' primer 
<400> 34 

gtcgacattt cattttaccc ccctcg 26 



<210> 35 
<21X> 1190 
<212> DNA 

<213> Tri choderma reesei 



<220> 

<221> promoter 

<222> (1) . . (1190) 

<223> T. reesei h£b2 promoter 



<400> 35 

ctcgagcagc tgaagcttgc atgcctgcat 
acactgccgt cgacgtctct cttccgacct 
cgctttgtat tattagagta tatgcaagtc 
aggacgatcg cggcacgata cgccctcgtt 
ggagcaagat tcagcccaag ggagatggac 
ccactccctt tggccaaatc agcttgcatg 
actaacaacc agggatggga aacgaagcct 
gagagaggga gagagacaat gagtgccaca 
aatgtcacat ccgagtcttg gggcctctgt 
aatagtcctc tttcttgagg actggatacc 
tttcgaagta ttatctggag gatagaagac 
ttggggagta gcccaacgaa aggtaactcc 
cagcctcttg gagccgtcat agatcacatc 
ctcatcatgg ccacatacta ctacatacgt 
caagacacca aggccccagc atcaagcctc 
gagagattgc gatgagccct ctccctacct 
agccaacgtg atgaaaacaa ggacatgagg 
ggccagaaaa aaaagtataa agaagacctc 
ctcatcagca cactcatcta caaccatcac 
tccaaacaca aacattcttt gttgaatacc 



cctttgtgag cgactgcatc cattttgcac 60 
tggccagctg gacaagcaac acaccaatga 120 
tcaggactat cgactcaact ctacccaccg 180 
ctcattggcc caagcagacc aactgcccct 24 0 
ggcagggcac gccaggcccc caccaccaag 3Q0 
tcaagagaca tcgagctgtg ccttgaaatt 360 
gcttttggaa agacaacaat gagagagaga 420 
aacctggtag tgctccgcca atgcgtctga 480 
gagaatgtcc agagtaatac gtgttttgcg 540 
tacgataccc tttttgagtt gaCgcggtgc 600 
gtctaggtaa ctacacaaaa ggcctatact 660 
tacggcctct tagagccgtc atagatccta 720 
tgtgtagacc gacattctat gaataatcat 780 
gtctctgcct acctgacacg tagcagtggc 840 
cctacctatc ccttccattg tacagcggca 900 
acagacggct gacaatgtcc gtataccacc 96 0 
aacagcctgc gagagctgga agatgaagag 1020 
gattcccgcc atccaacaat ctcttccatc 108 0 
cacattcact caactcctct ttctcaactc 1140 
aaccatcacc acctttcaag 1190 
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<210> 36 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 36 

aagcttgcat gcctgcatcc 20 



<210> 37 
<2ll> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3* primer 
<400> 37 

ccatggtgaa aggtggtgat ggttgg 26 



<210> 38 
<211> 13 
<212> PRT 

<213> Trichoderma reesei 
<220> 

<223> vild type T. reesei EGI peptide linker 
<400> 38 

Val Pro Arg Gly Ser Ser Ser Gly Thr Ala Pro Gly Gly 
15 10 



<210> 39 
<211> 10 
<:212> PRT 

<:213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: modified CBHII linker 
<400> 39 

Gly Ser Ser Ser Gly Thr Ala Pro Gly Gly 
15 10 



<210> 40 

<211> 19 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Met /Thrombin 
linker 

<400> 40 

Pro Gly Arg Pro Val Leu Thr Gly Pro Gly Met Gly Thr Ser Thr Ser 
15 10 15 

Ala Gly Pro 



<210> 41 
<2ll> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Met -containing linker 

<400> 41 

Pro Gly Ala Ser Thr ser Thr Gly Met Gly Pro Gly Gly 
15 10 



<210> 42 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; linker containing the thrombin 
cleavage site 

<400> 42 

Gly Thr Leu Val Pro Arg Gly Pro Ala Gly Val Asn Leu Val 
15 10 



